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Progress for the Third Reporting Period 

During the third reporting period our efforts were focused on a reformulation of the 
optimal control problem involving active state-variable inequality constraints. In the 
reformulated problem the optimization is carried out not with respect to all controllers, but 
only with respect to asymptotic controllers leading to the state constraint boundary. 

Intimately connected with the traditional formulation is the fact that when the reduced 
solution for such problems lies on a state constraint boundary, the corresponding boundary 
layer transitions are of finite time in the stretched time scale. Thus, it has been impossible 
so far to apply the classical asymptotic boundary layer theory to such problems 1 . 
Moreover, the traditional formulation leads to optimal controllers that are one-sided, that is, 
they break down when a disturbance throws the system on the prohibited side of die state 
constraint boundary. - 

In our second reporting period we tried to remedy these problems by using two 
successive transformations 2 . The first (Valentine's) transformation converts the state- 
variable inequality constraint into an equality constraint but at the expense of introducing a 
singular arc along the reduced solution. The second (Goh's) transformation then converts 
to a regular optimization problem. Although this procedure appeared to supply a 
mechanism for the construction of linearized boundary layer corrections (in Goh's 
variables), the resulting solutions (in Valentine's variables) consisted of impulse like 
controls and step-function-like fast states in the (linearized) boundary layer. This meant that 
the resulting (linearized) boundary layers (in Valentine's variables) supplied no additional 
information over that supplied by the reduced solution. Closer examination of the whole 
procedure in a general setting revealed that this behavior was rather not surprising and 
intimately connected with the singular arc. Specifically, any attempt to obtain a linearized 
boundary layer correction about a singular arc results in a (trivial) zero-time boundary layer 
which happens to be just a step function. 

The difficulties encountered during the second reporting period led us in the first half 
of the third reporting period (see Appendix A) to give up on Goh's transformation and try 
to invent an alternative near-optimal procedure for treating such problems. The idea that we 
came up with was to use a Valentine-like transformation to transform to new variables, 
such that, in the transformed system no trajectory violated the constraint, and, the reduced 
solution that rides the state constraint boundary corresponded to a singular arc at infinity. 
Linearized boundary layer expansions about such a singular arc were, just as expected, 
trivial step functions. By expanding however about a finite arc (that was nonsingular) we 
were able to obtain a nonlinear, two-sided feedback control law that tracked the reduced 
solution asymptotically. This corresponded to allowing -in the boundary layer only- two- 
sided perturbations of a prescribed size beyond the state constraint boundary. Our 
procedure was nearly- optimal in the same sense as feedback linearization, that is, because 
we were tracking a nearly-optimal reduced solution that usually tends to dominate the 
performance index if the final time is large compared to the characteristic times of the 
boundary layers. However, our procedure did not imply any optimality in the boundary 
layer and did not result in any uniformly valid approximation of die exact solution, because 
as the finite arc approached the singular arc at infinity the boundary layer degenerated again 
into a step function. For more details on this effort the reader is referred to Appendix A. 
Although the effort proved unsatisfactory, it made us look into the problem from a new 
perspective, which eventually, as we explain in the remainder of this report, did lead to 
some fruitful results. 
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Our futile attempts during the first two reporting periods and the first half of the third 
reporting period to combine the classical asymptotic boundary layer theory with die 
traditional formulation of optimal control problems involving active state-variable inequality 
constraints led us in the second half of the third reporting period to a complete 
reconsideration of the main issues. Above all was the realization that the way in which 
these problems were formulated resulted in finite-time, one-sided feedback control laws 
that were useless from a practical point of view. We suspected that the reason we were not 
getting the right answers was that we were not asking the right questions. We gradually 
realized that what we really had to insist upon and retain was the asymptoticity and the two- 
sidedness of our feedback control laws, not their optimality, defined according to these 
traditional standards. After all, these two features are both necessary (but not sufficient) to 
make a control law attractive from a practical point of view. Note that such a conflict never 
arises when a problem does not involve any active state-variable inequality constraints. In 
such a case the classical asymptotic boundary layer theory leads to feedback control laws 
that are both asymptotic and two-sided as desired. Thus, the idea that dominated the 
second half of the third reporting period was that, if we were to retain the asymptoticity and 
the two-sidedness of our control laws when solving optimal control problems involving 
active state- variable inequality constraints, then, we somehow had to build these two 
features explicitly into the problem. This meant that we would have to reformulate the 
boundary layer problem so that the optimization is carried over all controllers that lead 
asymptotically to the state constraint boundary (corresponding to the reduced solution) and 
do not break down when a perturbation throws the system on the prohibited side of the 
boundary. 

To accomplish the above task one will first have to isolate the asymptotic, two-sided 
controllers from the rest. It is in this specific area that we have made significant progress 
during our third reporting period 3 . Although the details are given in our draft paper 
prepared for the 1993 AIAA Guidance and Control Conference (see Appendix B), we will 
hint at our ideas here by considering the dynamical system: 


dx 

dt 

= f(x, y f u) 

O 

X 

II 

O 

'x 

(i) 

dy 

dt 

= g(x, y, u) 

y( l o> = y 0 

(2) 


where x, f are vectors of the same dimension, and y, g and u are scalars. We would like to 
describe the set Coi all piecewise continuous (in time) control laws u(x(t),y(t),t) that track a 
given hypersurface in the state space of the above system, given by the scalar equation: 

S(x, y) = 0 (3) 

that is, if u=u(x,y,t) is a specific control law belonging to C, then the system (1), (2), 
driven by u(x,y,t) for t > to (and assuming that S(x 0 ,yo) is not zero) will eventually reach 
the hypersuiface given in (3) and stay on it thereafter. A control law can drive the system 
onto the hypersurface (3) either in finite time or asymptotically. Accordingly, the set C is 
the union of two disjoint sets J and A. The set 7 contains all control laws that track 
hypersurface (3) in a finite time. The set J3 contains all control laws that track hypersurface 
(3) asymptotically. We would like to give a complete description of the sets 7 and 

Let us denote by Z the set of all piecewise differentiable, scalar functions of the real 
variable a, defined and invertible for all a in [0,1], and satisfying the boundary conditions: 
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z(0) = 0 ; 


z(l) = -S(x 0 , y 0 ) 


(4) 


We assume that the range of S is contained in the range of z for all z in Z. We consider a 
as a time variable and make the transformation from y to a: 

z(a) + S(x, y) = 0 (5) 

Differentiating (5) with respect to time and using (1) and (2) results in: 

We now let dot/dt play the role of a new control, P, by defining: 

lf = p O) 

then, (6) becomes: 

^||3+|| f (x,y,u) + ^-g(x,y, u )= 0 (8) 

We will assume for the sake of simplicity that the hypersurface (3) is first order in u, that 
is, the total time derivative of S(x,y) is explicitly dependent on u. We also assume that (5) 
is invertible in y and that (8) is invertible in u, resulting in the two equations: 

y = h(x, z) (9) 


u = k(x, z, ^ p) (10) 

for y and u respectively. System (1) and (2) has now been transformed through (5)-(10) to 
the equivalent system: 

^- = f(*,h(x.z(o)),k(x,z(a),^-R)) x(t„)=x 0 (11) 

<x(to) = 1 (12) 


If we now want to drive our original system (1), (2) onto the hypersurface (3) in a 
specified finite time tf, then all we have to do is use the control law for the transformed 
system 


o 

tM 

1 

M 

OQ. 

for 

t 0 <t <t f 

(13) 

p = 0 

for 

t f <t 

(14) 

that leads to the time variation for a: 

tf-t 

a = t _ t 
l f l o 

for 

t 0 < t < t f 

(15) 


3 



a = 0 


(16) 


for t f < t 


Thus, a is driven from 1 to 0, in to < t < tf and stays at zero for tf < t. Accordingly, 

due to the boundary conditions (4) on the function z(a), our original system (1), (2), is 
driven from its initial state at t=to onto the hypersurface (3) in to < t < tf and stays on the 
hypersurface for tf < t. The feedback controller u(x,y,t) that will perform this task for 

system (1), (2) can be found from (10): p is given by (13), (14), z is equal to -S(x,y) from 

(5), and since z(a) is invertible on [0,1], dz/da can be expressed as a function of z and 
therefore of -S(x,y). 

Similarly, if we want to drive our original system (1), (2) onto the hypersurface (3) 
asymptotically, then we can use the control law for the transformed system 

P = - a (17) 

that leads to the exponential time variation for a: 


Now, a is driven from 1 to 0 exponentially. Again, due to the boundary conditions (4) 

on the function z(a), our original system (1), (2), is driven asymptotically from its initial 
state at t=to onto the hypersurface (3). The feedback controller u(x,y,t) that will perform 

this task for system (1), (2) can be found from (10): P is given by (31), -S(x,y) is equal to 

z from (5), and since z(a) is invertible on [0,1], dz/da can be expressed as a function of z 
and therefore of -S(x,y). 

We can thus claim that fixing the control P as in (17), or as in (13), (14) establishes a 

correspondence between functions z(a) in Z and controllers u(x,y,t) in or J that bring 
system (1), (2) onto the hypersurface (3) asymptotically or in finite time respectively. As 
proven in the draft paper 3 for example (see Appendix B), once P is fixed as in (17), for 
every z(a) in Z there corresponds a controller u(x,y,t) in SL, and, more importantly, for 
every controller u(x,y,t) in J3, there corresponds a function z(a) in Z The correspondence 
therefore established between & and Z through the selection of P as in (17) is onto. Similar 
facts hold for the sets Zand J once P is fixed as in (13), (14). 

These results suggest that in an optimization problem in which the asymptotic tracking 
of a hypersurface is desired (like for example in a boundary layer problem that corresponds 
to an optimal control problem involving an active state-variable inequality constraint) we 

can forget about the "real" controllers u(x,y,t), fix the control P as in (17), and optimize 
over all functions z(a) in Z This amounts to carrying out the optimization for the original 
problem only over all (asymptotic) controllers in PL The resulting off-line optimization 
problem is guaranteed to have an infimum if the original problem does have a minimum 3 . 
An example that we supply in the draft paper 3 (see Appendix B) suggests that it may be 
possible for this off-line optimization problem not to achieve its infimum, that is, the 
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infimum may correspond to a finite-time controller (that belongs to J but not A). The 
question that arises naturally in such a case is how to find an asymptotic controller that 
somehow approximates this infimum. Although there are no general answers yet, in our 
draft paper 3 we were able to show that by imposing an additional isoperimetric constraint 
on the reformulated problem one can, at least for this particular example, select an 
asymptotic controller that approximates this infimum. 


Conclusions and Future Research 

The class of all piecewise continuous (in time) controllers that track a given 
hypersurface in the state space of a dynamical system can be split into two disjoint classes. 
TTie first class contains all controllers that track the hypersurface in finite time. The second 
class contains all controllers that track the hypersurface asymptotically. A transformation 
technique can be used to give a complete description of both classes. This splitting of the 
two classes can be used to reformulate optimal control problems involving active state- 
variable inequality constraints. The optimization in the reformulated problem is carried out 
over the class of asymptotic controllers only and not over the class of all controllers. If a 
minimum over all controllers exist, then the reformulated problem is guaranteed to have an 
infimum. An example suggests that the reformulated problem does not achieve its infimum, 
that is, the infimum corresponds to a finite-time controller. By imposing an additional 
isoperimetric constraint on the reformulated problem one can, at least for this particular 
example, select an asymptotic controller that approximates this infimum. 

Our primary goal for the immediate future is to generalize the results obtained for the 
particular example of the draft paper 3 to a general boundary layer system, corresponding to 
an optimal control problem involving active state-variable inequality constraints. We plan to 
examine in this general setting whether and when the off-line optimization problem 
corresponding to such a system achieves its infimum, that is, whether and when can one 
find an asymptotic controller that results in this infimum. For the cases in which this 
infimum is not achievable with an asymptotic controller, we would like to find out under 
what additional conditions can one approximate it (with an asymptotic controller). The 
isoperimetric constraint that we used in the example in our draft paper should provide us 
with some insight. Furthermore, we plan to extend our theory to the cases in which the 
hypersurface described by S(x,y) is n-th order in u, that is, one needs to differentiate 
S(x,y) n times with respect to time to get explicit dependence in u. Finally, we plan to 

examine whether the correspondence established in our draft paper between functions z(a) 
in Z and controllers u(x,y,t) in .3 or y is one-to-one. Since we showed that it is onto, if it 
turns out to be also one-to-one, this would mean that it is a bijection. Such a result would 
be important because it would imply a complete equivalence between the set Z and the sets 
j?or y. 
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Appendix A 

Progress for the First Half of the Third Reporting Period 

In this Appendix we supply a more detailed account of the work conducted during the 
first half of the third reporting period, namely, between 7/5/92 and 10/1/92. Our main 
objective during this effort was to somehow combine the classical asymptotic boundary 
layer theory with optimal control problems involving active state-variable inequality 
constraints. We supply the work as an Appendix because it proved unsatisfactory. 
However, it made us look into the problem from a new perspective, which eventually, as 
we explain in the main part of our report, did lead to some fruitful results and a conference 
paper. The reference numbers in this Appendix refer to the reference list on page 6. 

During the first half of the third reporting period our efforts were focused upon 
developing a method for applying the asymptotic boundary layer theory to optimal control 
problems involving active state variable inequality constraints. The main theoretical issue is 
that when the reduced solution for such problems lies on a state constraint boundary, the 
corresponding boundary layer transitions are of finite time in the stretched time scale. Thus, 
it has been impossible so far to apply the traditional asymptotic boundary layer theory to 
such problems. In our second reporting period we tried to remedy this problem by using 
two successive transformations 2 . The first (Valentine's) transformation converts the state- 
variable inequality constraint into an equality constraint but at the expense of introducing a 
singular arc along the reduced solution. The second (Goh's) transformation then converts 
to a regular optimization problem. Although this procedure appeared to supply a 
mechanism for the construction of linearized boundary layer corrections (in Goh's 
variables), the resulting solutions (in Valentine's variables) consisted of impulse like 
controls and step-function-like fast states in the (linearized) boundary layer. This meant that 
the resulting (linearized) boundary layers (in Valentine's variables) supplied no additional 
information over that supplied by the reduced solution. Closer examination of the whole 
procedure in a general setting reveals that this behavior is rather not surprising and 
intimately connected with the singular arc. Specifically, any attempt to obtain a linearized 
boundary layer correction about a singular arc results in a (trivial) zero-time boundary layer 
which is nothing else but a step function. 

The above difficulties encountered during the second reporting period led us to keep 
Goh's transformation aside and try to invent an alternative near-optimal procedure for 
treating such problems. The idea that we came up with and that we will try to expand upon 
in this report is the following: 

Suppose that in an optimal control problem we are imposed with a state-variable 
inequality constraint of the form: 

S(x,y)<0 (1) 

where x and y are the slow and fast state variables respectively. Then, we can eliminate the 
constraint by using the transformation 

e a + S(x,y)=0 (2) 

to transform from the (x,y) plane to the (x,a) plane. Any trajectory in the (x,a) plane is a 
trajectory that does not violate (1). Moreover, in the (x,a) plane a plays the role of the new 
slow state variable. A reduced solution in the (x,y) plane that rides the constraint (1) 
corresponds to a singular arc at infinity in the (x,a) plane. Linearized boundary layer 
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expansions about such a singular arc are just (trivial) step functions. However, if in the 
(x,a) plane we instead track an arc a=ao, where oq is a finite negative number, then, due 
to the properties of the transformation (2), even for moderate magnitudes of ao, in the (x,y) 
plane we are sill practically tracking the reduced solution S(x,y)=0 (that is, we are nearly 
optimal) and more importantly, since the arc a=cto is non-singular, it affords a non-trivial, 
asymptotic, linearized boundary layer expansion about it For example, if in the (x,a) plane 
we track the arc a=ao=-3, then in the (x,y) plane we are tracking the arc S(x,y)=-0.05, 
which is pretty close to the optimal arc S(x,y)=0. 


In order to demonstrate the applicability of the above idea we will now try to apply it to 
Example 1 of Ref. 1 . In this example we are given the singularly perturbed system: 


dx 7 

— = y - w 1 
dt J 

(3) 

dy 

£ dT =U 

(4) 

and the state-variable inequality constraint 


S = y - 1 <0 

(5) 

The initial conditions are x(0)=y(0)=0. The final value of x, x(tf)=x f is 
strictly positive, and the objective is to minimize the time required for x 
specified final value. Transformation (2) for the above system reads: 

specified and 
to reach this 

e“ + y - 1 = 0 

(6) 

Differentiating (6) with respect to time, and using (4) we obtain: 


e a aj + u = 0 

(7) 

where the new control variable otj is defined as: 


dot 

e_ dT =a i 

(8) 

The resulting equivalent unconstrained system is: 


dx = 1 _ e a _ e 2a a 2 
dt 1 

(9) 

da 

e ir =a i 

(10) 


with initial conditions x(0)=0, a(0)=0, and final condition x(tf)=x f >0. The Hamiltonian for 
system (9), (10) is: 


H = Xj (l - e“ - e 2a ctj) + X a aj + 1 = 0 
resulting in the optimality condition: 

=-2X x e 2a a 1 + X ot = 0 


( 11 ) 

( 12 ) 
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and the adjoint equations: 


X^ = con stant 


(13) 


e^ = X I e«(l + 2e«a;) (14) 

Using e=0 we find that there is only one possible reduced solution for the above system, 
given by: 

a® = 0, X.° = 0, a 0 = - °°, X. x = - 1, x°(t) = t, y° = 1, u° = 0 (15) 

Thus, as expected, in the the (x,a) plane the reduced solution corresponds to an arc at 
infinity. This arc is singular because the coefficient of oq vanishes along it in the optimality 
condition (12). 


At this point we are ready to make our major approximation. Instead of using (15) 
itself, we will approximate the singular arc at infinity by the nonsingular arc: 


a 0 = oto 


(16) 


where a 0 is a strictly negative finite number the value of which is left to our selection. 
Note that as oq tends to -«*> our reduced solution in the (x,y) plane rides the state constraint. 
Note also that even for moderate magnitudes of oq our reduced solution in the (x,y) plane 
will practically ride the state constraint. For example if we select oq=-3 then from (6) y° is 
approximately 0.95, which is very close to 1. The important point here is of course the fact 
that the arc given by (16) is a non-singular arc, which implies the possibility of an 
asymptotic boundary layer expansion about it. In summary, if we approximate the singular 
arc at infinity by the non-singular arc given by (16), we can easily verify that the exact 
reduced solution given in (15) will have to be approximated as: 


0 n -v 0 n 0 -4° 

a i = 0, X. a = 0, a = a 0 , X. x = 


a. 


- 1 


x°(t) = (l-e“°)t, y° = 




u° = 0 


(17) 


where we define y 0 as: 


y 0 = 


1 - e 


«o 


(18) 


We are now ready to examine the possibility of an asymptotic boundary layer 
expansion about the approximate reduced solution given in (17). Stretching the time scale 
near t=0 by: 


T = 


1 

e 


(19) 


the boundary layer equations for t near 0 can be written as: 

da 

— = a, 
dx 1 


(20) 
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^ = (^)°( 1+2 '° a? ) < 21 > 

The Hamiltonian evaluated in the boundary layer is given as: 

H BL = ^ — j(l - e° - e 2 «aj) + X OL a l + 1=0 (22) 

The boundary layer system can therefore be stated in the equivalent form of an optimization 
problem: 

minimize: J = | Y — - — Vl - e“ - e 2ct af) dx z f free (23) 

oU“°-lJ V 


subject to: -r- = ctj a(0)=0 (24) 

Note that our approximate reduced solution given in (17) serves as an approximate 
equilibrium solution of the boundary layer system (20), (21). In order to linearize the 
boundary layer system about the reduced solution given in (17) we define: 

5a = a-a° = a- a 0 (25) 


5a j = a, - af = aj 


(26) 


Then, the linearized boundary layer system can be stated in the equivalent form of the 
optimization problem: 

minimize: J = y / f ^H^8a 2 + 2H I ^ x 8a8a 1 + H BL a 8a 2 ^dx x f free (27) 

subject to: = §« 5a(0)=-Oo (28) 

ax 1 

where the second partials of H BL with respect to a and aj, evaluated along the approximate 
reduced solution (17) are found to be: 


h bl - 

H aa ~ 


H 


BL 

aa 


da 2 

= 3 2 H BL 
i 3a3aj 


3 


1 - e 


= 0 


«o 


H 


BL 

a i“i 


3 2 H BL 2e 2a ° 


Bat 


1 - e 


“o 


(29) 

(30) 

(31) 


The adjoint equation associated with (28) is: 


d(8X. a ) bl c „bl „ 

^ — H aa 8a H aaj 8aj 


(32) 
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where SA^ is defined as: 


6 A, a — Xa — A- (x — A^x 

The optimal control 5ocj is determined from the optimality condition: 

H^Sa+H^ 5a 1 + 5X„=0 


(33) 


(34) 


Solving (34) for 5a!, substituting into (28) and (32), and taking into account (29) through 
(31) results in the Hamiltonian system: 


d(5a) ( 

r a 0 
e 0 

dx 

. 2 

\ 

v 2e 

d(6A, a ) 

< 

e 

dx 

ve** 0 


5X„ 


5a 


(35) 


(36) 


The eigenvalues associated with this system are: 

_J5» 

e 2 


Sl= V2 


h = 


__^o 

e 2 

VT 


(37) 


Note that as a 0 tends to minus infinity the eigenvalues in (37) tend to plus or minus 
infinity, reasserting the fact that at this limit the boundary layer degenerates to a step 
function. 

The solution of system (35), (36) can be found using standard methods. First, in order 
to cancel the instability arising from Si the initial value of will have to be selected as: 


5A a (0) = 


3«o 
2 a 0 e 2 


V2(e“»- l) 

Then, the solution of (35), (36) can be shown to be: 


(38) 


6a(x) = - Oq exp i 


a„ 


i e 2 


yPi 


(39) 


3a f 


5A a (x) = 


2 OqC 2 

V2 (e 00 - l) 


exp 


xe 2 


(40) 
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The optimal control 5a t is determined from the optimality condition (34), using (29) 
through (31), (39), and (40): 


«o 


_^o 

xe 2 




(41) 


Comparing (41) with (39), 6aj can also be written in the form of a linear feedback control 
law: 


5a i(t) 



(42) 


Using now (25), (26), and (33), and replacing x by t/e, the composite solution in the (x,a) 
plane, resulting from the linearized asymptotic boundary layer expansion (39) through (42) 
can be found to be: 


a(t) = « 0 


1 - exp j 


te 2 


eV2 J 


(43) 


aj(t) 


a 0 e 2 

V 2 



an 


t e 2 


V2 J 


(44) 


MO = 


3« o 
2a 0 e 2 

V2 (e“» - l) 


a r 


exp ) 


t e 2 

c a/2 J 


The linear feedback control law found in (42) now becomes: 


(45) 


aj(t) = - 


__^o 

^=-(a(t)-a 0 ) 


(46) 


Note that as the boundary layer time t/e tends to infinity, a, aj and X a approach their 
(approximate) reduced solution values given in (17). Note also that as ocq tends to -°°, the 
solutions given in (43) through (45) degenerate into step functions. 

In order to transform the above solution to the (x,y) plane we use (6), (7), and (18). 
The result for y(t) is: 
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y(t) = 1 - (1 - y 0 ) 


-'4 3 *^)} 


(47) 


The linear feedback control law in (46) is now transformed to the nonlinear feedback 
control law: 


u(t) = 


V2(i-y 0 ) ^ 1-y ° ' 


(48) 


Note that since y 0 is less than 1, the nonlinear feedback control law in (48) is able to handle 
two-sided perturbations about y 0 , as long as these perturbations remain small and do not 
exceed 1. To complete the solution in the (x,y) plane we will also have to have the solution 
for x(t). But this is just the reduced solution found in (17): 


x(t) = x°(t) = (l-e a °)t 


(49) 


Our approximate minimum final time is the time at which x assumes its final value x f , 
which from (49) is: 




(1 - e” 0 ) 


(50) 


Note that as % tends to tf tends to its exact value 1 x f , which comes as a confirmation of 
the near-optimality of our procedure. 


In order to assess further this near-optimality we will now try to find the solution to 
the nonlinear boundary layer equations ((23), (24)) in the (x,a) plane, and then compare it 
with the exact solution given in reference 2 as oq tends to -«>. 


The optimality condition for the nonlinear case is obtained by differentiating the 
boundary layer Hamiltonian given in (22) with respect to oq: 


0H BL 

do^ 


2 e 2ct a 


a n 


- 1 


— h A, a — 0 


(51) 


Substituting for ^ from (51) into (22) we obtain: 




(52) 


Substituting for 04 from (52) into (24) results in a nonlinear differential equation for a: 

(w) 2 = a <°>=° (53) 


Using ( 6 ), and (18) we can show that (53) is equivalent to: 
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y(0)=o 


(54) 



Note that a solution does not exist, unless y is less than y 0 . Assuming that this is the case, 
and taking into account that in order for y to reach 1 dy/dx near t=0 will have to be positive, 
we can show that the unique solution of (54) is: 

y(i) = - x + < 55 > 


From reference 1 we can easily verify that the exact nonlinear boundary layer solution is: 

y(x)=-^+t (56) 

Comparing (55) with (56), and taking into account (18), we see that as oq tends to -«», y 0 
tends to 1, and our solution (55) tends to the exact solution (56) as expected. 

In order to derive a nonlinear feedback control law we now use (4), (19), and (54) 
which results in: 

ud) = (y 0 -y(*)) 1/2 (57) 


From reference 1 the nonlinear feedback control law that corresponds to the exact nonlinear 
boundary layer solution is: 


u(x) = (1 -y(x)) ' (58) 

A comparison of (57) with (58) shows that as yo tends to 1 the two control laws become 
identical as expected. 


Comments 

From a practical point of view the most important product of a procedure such as the 
above would be the two feedback control laws found in (48), and (57). Although both laws 
are nonlinear, there are important differences between the two. Specifically, the law 
supplied in (48) was found by using the linearized boundary layer system in the (x,a) 
plane. The law supplied by (57) on the other hand was found by using the full nonlinear 
boundary layer system in the (x,a) plane. In order to keep this distinction clear we will 
from now on use the notations LBL and NBL to refer to the linearized and nonlinear 
boundary layer systems respectively. Thus, (48) will be referred to as the LBL law and 
(57) will be referred to as the NBL law. The striking difference between the two laws is 
that the LBL law is asymptotic, while the NBL law is finite time. This can be concluded by 
examining (47) and (55), representing the solutions for y that the two laws give rise to 
respectively. In (47) y reaches y 0 as the time x=t/e goes to infinity. In (55) y reaches y 0 at 
the time Xf=2(y 0 ) 1/2 which in turn tends to the exact value 1 Xf=2 as yo tends to 1. The 
asymptoticity of the LBL law makes it attractive from a practical viewpoint. Moreover, and 
this is the single most important point of the present work, the LBL law can handle two- 
sided perturbations about yo, while the NBL law can't. However, as yo tends to 1 the NBL 
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law tends to the exact feedback control law given in reference 1, while the LBL law breaks 
down. The reason for this breakdown is that at this limit the linearized boundary layer 
expansion in the (x,a) plane is about the singular arc at infinity, supplying no new 
information over that supplied by the reduced solution. This also suggests that the 
composite solution obtained using the LBL equations in the (x,a) plane cannot be a 
uniformly valid approximation to the exact solution as yo tends to 1. 

The ability of the LBL law to handle two-sided perturbations about y 0 is the major 
justification for our introduction of the above procedure. Our procedure is nearly-optimal in 
the same sense that procedures employing feedback linearization are nearly-optimal, that is, 
because we track a nearly-optimal reduced solution that usually dominates the performance 
index if the final time is large compared to the characteristic times of the boundary layers. 
However, our procedure is not as ad hoc as feedback linearization, and, more importantly, 
the LBL law is nonlinear in the state perturbations, suggesting that there might be a hope 
of avoiding control saturation problems that are sometimes encountered when using 
feedback linearization. Of course, the major setback in our procedure is the fact that it 
cannot supply us with a composite solution that uniformly approximates the exact solution. 
Thus, the question of combining the classical asymptotic boundary layer theory with 
optimal control problems involving active state-variable inequality constraints still remains 
unresolved. 


15 



Appendix B 


16 



Draft Paper for AIAA Guidance, Navigation and Control Conference 
Monterey, CA, August 9-11, 1993 


Near -Optimal, Asymptotic Tracking in Control Problems 
Involving State-Variable Inequality Constraints 


N. Markopoulos*, and A. J. Calise** 
Georgia Institute of Technology 
Aerospace Engineering 
Atlanta, GA 30332 


Abstract 

A transformation technique is used that splits the class of all piecewise continuous (in 
time) controllers that track a given hypersurface in the state space of a dynamical system 
into two disjoint classes. The first class contains all controllers that track the hypersurface 
in finite time. The second class contains all controllers that track the hypersurface 
asymptotically. Four theorems are presented that describe the two classes completely. The 
results are applied to the study of optimal control problems involving active state variable 
inequality constraints. The controllers obtained from the traditional formulation of such 
problems suffer from serious practical flaws. They typically tend to track the hypersurface 
representing a state constraint boundary in a finite time, which makes the traditional 
asymptotic boundary layer theory non-applicable, and they break down whenever a 
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disturbance causes the system to violate the active state constraint. As an alternative, this 
paper proposes a reformulation of such problems in which the optimization is earned out 
with respect to asymptotic controllers only and not with respect to all controllers. If a 
minimum over all controllers exists, then this reformulated problem is guaranteed to have 
an infimum. An example suggests that the reformulated problem does not achieve its 
infimum, that is, the infimum corresponds to a finite-time controller. By imposing an 
additional isoperimetric constraint on the reformulated problem it is shown that one can, at 
least for this particular example, select an asymptotic controller that approximates this 
infimum. 


I. Introduction 

State-variable inequality constraints are commonly encountered in the study of 
dynamic systems. The study of rigid body aircraft dynamics and control is certainly no 
exception. For instance, a maximum allowable value of dynamic pressure is usually 
prescribed for aircraft with supersonic capability. This limit is required to ensure that the 
vehicle's structural integrity is maintained. Given a typical state-space description of the 
vehicle dynamics, this limit constitutes an inequality constraint on the vehicle state. Such 
dynamic pressure bounds are commonly encountered during fuel-optimal climb for 
supersonic transports 1 , for rocket powered launch vehicles such as the U.S. space shuttle 2 , 
and for single-stage-to-orbit air-breathing launch vehicles 3 . 

State-variable inequality constraints have been studied extensively by researchers in the 
field of optimal control. First-order necessary conditions for optimality when general 
functions of the state are constrained have been obtained 4 * 6 . However, the direct 
construction of solutions via this set of conditions proves difficult. Moreover, the 
controllers derived from such traditional formulations of the problem suffer from serious 
practical flaws. They typically tend to track the hypersurface representing a state constraint 
boundary in a finite time, which makes the traditional asymptotic boundary layer theory 



non-applicable (see below), and they break down whenever a disturbance causes the 
system to violate an active state constraint Accordingly, most practitioners seeking an open 
loop control solution rely on direct approaches to optimization that employ penalty 
functions for satisfaction of state-variable inequality constraints . 7 As a rule however, 
algorithms employing such methods are computationally intense and slow to converge. 
Consequently, they are not well suited for real-time implementation. 

From a Singular Perturbations point of view 8 - 9 , in the absence of a state-variable 
inequality constraint (i.e. when no constraint is active), the initial boundary layer solution 
for the class of systems being considered is an infinite time process. A solution is sought 
which asymtotically approaches the reduced solution. However, when a state constraint is 
active in the reduced solution, the boundary layer problem can be of finite time in the 
stretched time variable 10 - 11 . Thus, traditional techniques concerning the asymptotic stability 
of the boundary layer system are not applicable, and cannot be used to construct an 
approximate boundary layer solution. The presence of an active state- variable inequality 
constraint also introduces the possibility of discontinuous costate variables at the juncture 
between constrained and unconstrained arcs. A Valentine transformation can be used to 
convert the constrained problem to an equivalent unconstrained problem of increased 
dimension 12 . Smoothness is regained in the process, but at the expense of introducing a 
singular arc along the state constraint boundary 12 - 13 and to little or no advantage when 
seeking a solution for real-time implementation. 

As an alternative, this paper proposes a complete reformulation of optimal control 
problems involving active state variable inequality constraints. Since in practice it is always 
the asymptotic controllers that have the most desirable properties, maybe the optimization 
of such problems should be carried out not over the class of all controllers, but only over 
the class of asymptotic controllers that track a given active state constraint boundary. It is 
shown in Section n that a transformation technique can be used to isolate and describe 
completely this class of asymptotic controllers. If a minimum over the class of all 



controllers exists, then the reformulated problem is guaranteed to have an infimum. Our 
results in Section HI suggest however that this infimum for the reformulated problem 
corresponds to a finite-time controller and is not achieved over the class of all asymptotic 
controllers. The situation is somewhat reminiscent of H-infinity control theory for linear 
systems in which we seek a proper, stabilizing controller to minimize the H-infinity norm 
of a closed- loop transfer function. The minimum of this norm over all proper, stabilizing 
controllers does not exist. Its infimum however does exist and corresponds to an improper 
controller. Thus, just as in H-infinity theory, the question arises naturally in our case as to 
how we can find an asymptotic controller that somehow approximates this infimum. 
Although there are no general answers yet, a procedure is presented in Section ID that does 
supply us with an answer at least for a simple example. 

II. Construction of arbitrary nonlinear feedback control laws for a 
dynamical system that tracks a given hypersurface 

Consider the dynamical system: 

■^- = f(x,y, u) x(t 0 )=x 0 (!) 

dy 

-^- = g(x,y, u) y(t 0 ) =y 0 (2) 

where x, f are vectors of the same dimension, and y, g and u are scalars. It will be of 
interest to describe the set Cof all piecewise continuous (in time) control laws u(x(t),y(t),t) 
that track a given hypersurface in the state space of the above system, given by the scalar 
equation: 

S(x, y) = 0 (3) 

that is, if u=u(x,y,t) is a specific control law belonging to C, then the system (1), (2), 
driven by u(x,y,t) for t > to (and assuming that S(x 0 ,y 0 ) is not zero) will eventually reach 


the hypersurface given in (3) and stay on it thereafter. A control law can drive the system 
onto the hypersurface (3) either in finite time or asymptotically. Accordingly, the set C is 
the union of two disjoint sets 7 and The set 7 contains all control laws that track 
hypersurface (3) in finite time. The set & contains all control laws that track hypersurface 
(3) asymptotically. Our purpose in this section is to give a complete description of the sets 
7 and A. 

We denote by Z the set of all piecewise differentiable, scalar functions of the real 

variable a, defined and invertible for all a in [0,1], and satisfying the boundary conditions: 

z(0) = 0 ; z(l)=-S(x 0 ,y 0 ) (4) 

We assume that the range of S is contained in the range of z for all z in Z. We consider ct 

as a time variable and make the transformation from y to a: 

z(a) + S(x, y) =0 (5) 

Differentiating (5) with respect to time and using (1) and (2) results in: 

ll^ + § f(XiyiU)+ lf g(x>y>u)=0 (6> 

We now let da/dt play the role of a new control, (5, by defining: 

= P (?) 

dt K 

then, (6) becomes: 

^P + ff(x,y,u ) + |^g(x,y,«)=0 (8) 

We now assume that the hypersurface (3) is first order in u, that is, the total time derivative 
of S(x,y) is explicitly dependent on u. We also assume that (5) is invertible in y and that (8) 

is invertible in u, resulting in the two equations: 

y =h(x, z) 


( 9 ) 



for y and u respectively. System (1) and (2) has now been transformed through (5)-(10) to 
the equivalent system: 

y =f(x. h(x, z(a)), k(x, z(a), "j~P)) x (to) =x o O 1 ) 

$ = P «<>o)= 1 (12) 

Finite time tracking 

Theorem 1 : For any given finite time tf and any function z(a) in Z, there exists a 
control law u=u(x,y,t) in J that drives system (1), (2) from its initial state at t=to onto the 
hypersurface (3) at t=tf and keeps it on the hypersurface thereafter. 


Proof : Let t f be a finite time and z(a) be a function in Z. We can use z(a) to obtain the 
equivalent transformed system (11), (12). Then, we can use as our control P the function: 



for 

t 0 <t <tf 

(13) 

P = 0 

for 

t f <t 

(14) 

leading to the time variation for a: 

tf-* 
a= t _ t 

l f l o 

for 

t<> < t < tf 

(15) 

o 

II 

8 

for 

t f <t 

(16) 


Thus, a is driven from 1 to 0, in to < t < tf and stays at zero for tf < t. Accordingly, 
due to the boundary conditions (4) on the function z(a), our original system (1), (2), is 
driven from its initial state at t=to onto the hypersurface (3) in to < t < tf and stays on the 
hypersurface for t f < t. The feedback controller u(x,y,t) that will perform this task for 
system (1), (2) can be found from (10): P is given by (13), (14), z is equal to -S(x,y) from 
(5) and since z(a) is invertible on [0,1], dz/da can be expressed as a function of z and 
therefore of -S(x,y). The procedure is better illustrated through an example. 
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Example 1 

Consider the system: 


dx 2 

dr=y- u 


dy 

dt 


= u 


and assume that we want to track the line 

y - 1 = 0 

in a specified final time tf. As our function z(a) we select: 

z(a) = ( • i n 2° ) ln G + a ) 


x(to)=x 0 

(17) 

y( t o>=yo 

(18) 

0 

(19) 


( 20 ) 


note that z(0)=0, and z(l)=l-y 0 as required by (4). From (20) we are led to the 
transformation: 


(T^j ta < 1+a ) + y- 1=0 


( 21 ) 


Differentiating (21) with respect to time we obtain: 
resulting in the transformed system: 


dt F 


If we now use the control P given in (13), then, from (22) we obtain: 

. exp (-In 2^)1 

ln2(t f -t 0 ) 1 U-yoJi 


( 22 ) 


x(t 0 )=x 0 
ot(t 0 ) = 1 


(23) 

(24) 


( 25 ) 



This open loop control can be shown to drive system (17), (18) from its original state at 
t=to onto the line y=l at t=t f . To obtain a closed loop (feedback) controller we simply 
replace y 0 by y and to by t in (25), to obtain: 


i-y 

In 4 (t f - 1) 


(26) 


which again drives the system toward y=l at t=tf. 

Theorem 2 : Let u=u(x,y,t) be any control law in J that drives system (1), (2) from its 
initial state at t=to onto the hypersurface (3) at t=tf and keeps it on the hypersurface 
thereafter. Then, there exists a function z(a) in Z, such that, when system (1), (2) is 
transformed using z(a) to the system given by (1 1), (12), the control (3 for the transformed 

system corresponding to u(x,y,t) is given by (13), (14). 

Proof : The function z(a) for 0 < a < 1 is found by solving the system of differential 

equations: 

^ 7 =-(t f -t 0 )f(x,y, u(x, y,t)) ( 27 > 


dz 

da 




9S f(x, y, u(x, y, 0) + ^~g(x, y, u(x, y, t)) 


dx 


(28) 


subject to the boundary conditions: 

x(a = 1) = x 0 ; z(a= 1) =-S(x 0 , y 0 ) (29) 

where y in (27), (28) is a function of x and z through (9) and t in u(x,y,t) is a function of a 
through (15), that is, 

t = t f - a(t f -t 0 ) < 3 P> 

Note that the function z(a) found from the solution of (27) through (29) satisfies (5) 
for all t > to (a and t are related by (15)). Thus, it also satisfies the boundary condition 
z(0)=0, since at t = t f , a and S(x,y) are both equal to zero. 

The preceding two theorems tell us that once the control P for the transformed system 
(1 1), (12) is fixed as in (13), (14), there is a complete correspondence between functions in 
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Z and piecewise continuous control laws u(x,y,t) in 7 that track hypersurface (3) in a finite 
time tf. That is, for every element of Z there exists an element of 7 and, more importantly, 
for every element of 7 there exists an element of Z. The correspondence therefore 
established between Jand Z through the selection of (J as in (13), (14) is onto. 

Asymptotic tracking 

Theorem 3 : For any function z(a) in Z, there exists a control law u=u(x,y,t) in that 

drives system (1), (2) from its initial state at t=to onto the hypersurface (3) asymptotically. 

Proof : Let z(a) be a function in Z We use z(a) to obtain the equivalent transformed 

system (1 1), (12). We use as our control (5 the function: 

P = - a (31) 

leading to the exponential time variation for a: 

a = e -t (32) 

Thus, a is driven exponentially from 1 to 0. Accordingly, due to the boundary 
conditions (4) on the function z(a), our original system (1), (2), is driven asymptotically 
from its initial state at t=to onto the hypersurface (3). The feedback controller u(x,y,t) that 
will perform this task for system (1), (2) can be found from (10): P is given by (31), z is 
equal to -S(x,y) from (5) and since z(a) is invertible on [0,1], dz/da can be expressed as a 
function of z and therefore of -S(x,y). Again, we illustrate through an example. 


Example 2 



Consider again the system: 



dx 2 

df=y- u 

X 

/-N 

r* 

II 

X 

o 

(33) 

dy 

dt U 

II 

o 

(34) 


and assume that we want to track asymptotically the line 

y - 1 = 0 (35) 

As our function z(a) we again select: 

z(a)= (“hTT i J ln(1 + a) (36) 

satisfying z(0)=0, and z(l)=l-y 0 as required by (4). From (36) we are led to the 
transformation: 

(“EI i J ln ( 1 + a)+y_ l = 0 (37) 


Differentiating (37) with respect to time we obtain: 



+ u = 0 


(38) 


resulting in the transformed system: 







x(t 0 )=x 0 

(39) 

a(t 0 ) = l 

(40) 


If we now use the control P given in (31), then, from (38) we obtain: 


u = 


e k a-y)_i 


k e 


k( 1 - y ) 


(41) 


where k is defined as: 

k = T ^- (42) 

!-yo 


This open loop control can be shown to drive system (33), (34) from its original state at 
t=to asymptotically onto the line y=l. To obtain a closed loop (feedback) controller we 
simply replace y 0 by y in (41), to obtain: 



(43) 


1 1 


u = 


i-y 

ln4 


which again drives the system (33), (34) asymptotically toward y=l. Note that as expected, 
in the asymptotic case u does not depend explicitly on t. 

Theorem 4 : Let u=u(x,y,t) be any control law that drives system (1), (2) from its 
initial state at t=to onto the hypersurface (3) asymptotically. Then, there exists a function 
z(a) in Z, such that, when system (1), (2) is transformed using z(a) to the system given 
by (11), (12), the control P for the transformed system corresponding to u(x,y,t) is given 
by (31). 

Proof : The function z(a) for 0 < a < 1 is found by solving the system of differential 
equations: 

a^- =-f(x, y, u(x, y, t)) (44) 

“da = H f(x,y> u ( x »y> t )) + ^g(x,y, u(x,y,t)) (45) 


subject to the boundary conditions: 

x(a = 1) = x 0 ; z(a= 1) =-S(x 0 ,y 0 ) (46) 

where y in (44), (45) is a function of x and z through (9) and t in u(x,y,t) is a function of a 
through (32), that is, 

«=m) < 47 > 

Note that the function z(a) found from the solution of (44) through (46) satisfies (5) 
for all t > to (a and t are related by (32)). Thus, it also satisfies the boundary condition 
z(0)=0, since as t approaches infinity a and S(x,y) both tend to zero. 

Theorems 3, and 4 tell us that once the control P for the transformed system (11), (12) 
is fixed as in (31), there is a complete correspondence between functions in Z and 
piecewise continuous control laws u(x,y,t) in A that track hypersurface (3) asymptotically. 


That is, for every element of Z there exists an element of A and, more importantly, for 
every element of J? there exists an element of Z The correspondence therefore established 
between and Z through the selection of P as in (3 1) is onto. 

III. Significance for optimal control problems involving active 
state- variable inequality constraints 

The ideas presented in the last section can be appropriately utilized in the study of 
optimal control problems involving active state-variable inequality constraints. A common 
feature of such problems is that when a portion of the optimal trajectory rides the 
hypersurface representing a state constraint boundary, the optimal transition to this 
hypersurface from an initial point that does not lie on it occurs in finite time. Consequently, 
the corresponding optimal feedback controllers for such problems, that can be obtained by 
well-known analytical or numerical methods, suffer from two flaws that tend to eliminate 
almost completely their practical usefulness. First, such feedback controllers are finite-time, 
meaning that, the traditional asymptotic boundary layer theory is not applicable. Second, if 
a disturbance throws the system instantaneously toward the prohibited side of the 
hypersurface, the feedback scheme breaks down and there is no "optimal" way of returning 
to the hypersurface 10 . 

Asymptotic controllers on the other hand, capable of tracking from both sides a 
hypersurface representing a state constraint boundary, presumably won't suffer from either 
of the above two flaws. This observation, and the ideas presented in the last section 
suggest that once we know that a portion of the optimal trajectory for a problem rides such 
a hypersurface, we can change our point of view and try to optimize the system over all 
asymptotic controllers capable of tracking that hypersurface from both sides. 

Consider therefore that we are to minimize the performance index: 



13 


•r 

j=<D[x(t f ),y(t f )]+ J L (x,y, u)dt (48) 

«o 

subject to the dynamical equations, 

f--f(x.y,u) (49) 

= g(x. y. u) (50) 

the boundary conditions, 

x ( t o) = x o» y(t 0 ) = yo ; *o flxed » l f tee (51) 

and the state variable inequality constraint: 

S(x, y) ^ 0 (52) 


As in the last section, x, f arc vectors of the same dimension, and L, S, y, g and u are 
scalars. We will assume that 

S(x 0 , y 0 ) <0 ( 53 ) 

and that the optimal trajectory reaches the hypersurface 

S(x, y) = 0 (54) 

at a finite time t=t x and stays on it for t > tj. Thus, in order to avoid the problems with the 
finite-time controllers mentioned above we would now like to optimize J over the class of 
all asymptotic controllers, which we have already denoted by A, capable of tracking 
hypersurface (54) from both sides. In order to formulate this problem we first use an 
arbitrary function z(a) from the set of functions Z defined in the last section to transform 
our problem as follows: 

Our transformation reads: 

z(a) + S(x, y) =0 (55) 

with the function z(a) subject to the boundary conditions: 



z(0) = 0 ; 


z(l) — S(x 0 , y 0 ) 


(56) 


1 4 


This leads as in the last section (see equations (5) through (12)) to: 


^ P+ Hf(x, y, u) + ff-gU, y, U) = 0 


(57) 

and to the transformed system: 



y = h(x, z) 


(58) 

u = k(x, z, p) 


(59) 

f-=f(x, h(x, z(a)), k(x, z(a) , g^P)) 

x(t 0 ) =x 0 

(60) 

¥=P 

dt H 

a(t 0 )= 1 

(61) 

The performance index to be minimized assumes the form: 







j= <i>[x(t f ),h(x(t f ), z(a(t f )))]+ Jl(x, h(x,z), k(x, z, ^p))dt (62) 

*0 


As we saw in the last section (Theorems 3 and 4), once the control (3 for the 
transformed system (60), (61) is fixed at P=-a, there is a complete correspondence 
between functions z(a) in Z and piecewise continuous control laws u(x,y,t) in that track 
hypersurface (3) asymptotically. That is, for every element of Z there exists an element of 
A and, more importantly, for every element of & there exists an element of Z To find the 
"best" asymptotic controller therefore it is natural to fix the control P at 

P = -a (63) 

and then try to determine the "best" function z(a) in Zthat minimizes J. This leads directly 
to the off-line optimization problem: Minimize 

J 1 = <J>[x(0),h(x(0), 0)] + , h(x, z), k(x, z, -<xv))d<x (64) 

o 
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subject to: 

^ =- (-i-)f(x, h(x, z),k(x, z,-av» x(l)=x 0 (65) 

^=v z(0) = 0 ; z(l) =-S(x 0 , y 0 ) (66) 

We can see from (64) that for every function z(a) in Z there corresponds a number Jj(z). 

Thus, we can define the set of real numbers: 

3- [ Jj(z) : z is in Z} (67) 

and state our off-line optimization problem (64) through (66) by asking for the minimum of 
3. Although we do not know at the present if and when such a minimum exists, we can 
state the following theorem that provides us with an important partial answer: 

Theorem 5 : If the original problem (48)-(52) involving an active state variable 
inequality constraint has a minimum, then the off-line optimization problem (64)- (66) has 
an infimum. 

Proof : Assume that problem (48)-(52) has a minimum, which we denote by J min . 
This immediately implies that the set 3 is bounded below by J^, that is, ^ Jj(z) for 
all z in Z. Since any set of real numbers that is bounded below has an infimum, 3 has an 
infimum. 

Example 3 

To illustrate the above idea we will apply it to the following problem: Minimize 

J = J(l - y + u 2 ) dt (68) 

o 


subject to: 


dy 

dt -u ; 


y(0) =0 


(69) 



y - 1 £0 


(70) 
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The solution to this problem 


for 0 < t < 2 can be shown to be 10 : 

.2 


y = --x + t ; 


u= -i + i 


(71) 


leading to the finite-time controller: 

>> = (i-y) 1/2 (72) 

I 

Although this controller is optimal, it is clearly useless from a practical point of view since 
it breaks down when y exceeds 1. It cannot be used to track the line y=l in the presence of 
two-sided perturbations about y=l. At t = 2 y reaches the value y = 1 . For t > 2, y and u 
stay constant at 1 and 0 respectively and there is no more contribution to the performance 
index J. In order to optimize J over all asymptotic controllers that track y = 1, we now 
make the transformation: 

z(a) + y - 1=0 (73) 

z(0) = 0 ; z(l) = l (74) 


which leads to: 

4^- P+ u = 0 (75) 

da K 

and to the transformed optimization problem: Minimize 

(76) 

subject to: 

^=p a (0) = 1 (77) 

Using now the control: 

P = - a (78) 


leads to the off-line optimization problem for the function z(a): Minimize 
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J i = J(l" +av2 ) da 


(79) 


subject to: 


dz 

da 


= v z(0) = 0 ; z(l) = l 


The Hamiltonian associated with this problem is: 


H = ^ +av 2 +Xv 


(80) 


(81) 


which results in the optimality condition for v: 


v = — 


2a 


and in the adjoint equation for X: 


6X _L 

da a 


(82) 


(83) 


The solution for X is: 


X = ln 


(84) 


where A is an integration constant. Combining (82) and (84) with (80) we obtain the 
differential equation for z: 

(85> 


which has the general solution: 


« ± 0 - 1*1 ) 2 


+ B 


( 86 ) 


B being a second integration constant. A quick inspection now reveals that there are no 
values of A, B, for which the function z(a) in (86) can satisfy both of the boundary 
conditions z(0)=0, z(l)=l. The off-line optimization problem therefore posed in (79), (80) 
has no minimum. As guaranteed however by Theorem 5, it does have an infimum, since it 
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is bounded below by where, is the minimum value of the performance index in 
(68), that is, 

J min= 2 ]( 1 + T _t ) dt= T (87) 

In order to see what kind of controller u the function z(a) found in (86) implies, we use 
(75), (78), and (85) to find: 

u=-^(t+ln|A|) (88) 

Comparing with (71), we see that (88) implies the optimal, finite-time controller found 
before. Indeed, (88) can be shown to lead to (71) if the boundary conditions y(0)=0 and 
y(2)=l are imposed and the equation dy/dt=u is integrated. There is no contradiction 
however with either Theorem 3 or Theorem 4, since there is no function z in Z 
corresponding to the finite-time controller u given in (88). 

The above result implies that the minimum value of J, Jmin^/3, found in (87) is not 
only a lower bound of J! but the actual infimum itself. This situation is reminiscent of H- 
infinity control theory in which we seek a proper, stabilizing controller to minimize the H- 
infinity norm of a closed-loop transfer function. The minimum of this norm over all proper, 
stabilizing controllers does not exist. Its infimum however does exist and corresponds to an 
improper controller. Thus, just as in H-infinity, the question arises naturally in our case as 
to how we can find an asymptotic controller that somehow approximates this infimum. 
Although there are no general answers yet, we will present a procedure that does supply us 
with an answer at least for the above example: 

First we can show using (85) that for the off-line optimization problem (79), (80), the 


integral 


l v2d “ = |(^r) ln2 |al da 


(89) 



representing the "total energy" stored in the signal v(a) diverges. This suggests that if we 
impose the isoperimetric constraint 


1 

/v 2 da = k 

o 


(90) 


on the off-line optimization problem (79), (80), where k is a given, finite, strictly positive 
number, we may have a hope of finding a function z(a) in Z, that is, one that does satisfy 
the boundary conditions z(0)=0 and z(l)=l. The Hamiltonian associated with this new 
problem (79), (80), and (90) will read: 

H= ^- + (^t + a) v 2 + kv (91) 


where |i is a constant Lagrange multiplier. For each value of k there corresponds a specific 
value of (X and vice versa. The corresponding optimality condition for v now becomes: 


X 

2(p. + a) 


(92) 


The adjoint equation for X remains unchanged as in (83) and leads to the same solution for 
X given in (84), while the differential equation for z now reads: 


dz _ 1 . A 

da 2(n + a) m a 


(93) 


(93) leads to the solution: 


a 


<(a)=-4 J 


2 i(l*+0 


In 


dk 


(94) 


satisfying the boundary condition z(0)=0. The boundary condition z(l)=l is satisfied by 
the choice of A (as a function of |i) that guarantees that: 


i = -i! 


i 


2 


In 


A 

$ 


dt. 


(95) 


finally, the particular value of |i is evaluated from (90) once a value for k has been 
specified. That the improper integrals on the right-hand-sides of (94), (95) exist will be 



argued upon more rigorously in the final paper. Here we will supply a small indication to 
convince the reader that this is indeed so: Consider the value of k corresponding to the 
value |i=l. Then, the integral on the right-hand-side of (95) can be evaluated easily from 
any Table of Definite Integrals 14 , and (95) can be solved for A to yield: 


In | A| = - 


(24 +7C 2 ) 
12 ln2 


(96) 


The corresponding asymptotic controller u can be found from (75), (77), (78), (93), (96) 
and the fact that we set p=l. The result is: 


u = 


*11 f. 

2(1+ e-0^ 


24 + n 2 
12 ln2 



(97) 


It can be shown that this controller drives y asymptotically from y=0 at t=0 to y=l as t 
approaches infinity. This fact however is already guaranteed by Theorem 3. To construct 
an asymptotic feedback controller, one will have to use (97) to integrate the equation 
dy/dt=u. This will lead to an expression for y as a function of t. Elimination of t between 
this expression and (97) will lead to the desired asymptotic feedback controller. Or, 
alternatively, one can use the transformation (73) with z(a) supplied by (94) and express a 
as a function of y. Then, using (93) and (75) with P=-a should result in the same 
asymptotic feedback controller as before. Although the procedure cannot be carried out 
analytically for either case, it can be carried out numerically and relevant results and details 
will be given in the final paper. 


IV. Conclusions 

The class of all piecewise continuous (in time) controllers that track a given 
hypersurface in the state space of a dynamical system can be split into two disjoint classes. 
The first class contains all controllers that track the hypersurface in finite time. The second 
class contains all controllers that track the hypersurface asymptotically. A transformation 



technique can be used to give a complete description of both classes. This splitting of the 
two classes can be used to reformulate optimal control problems involving active state 
variable inequality constraints. The optimization in the reformulated problem is carried out 
over the class of asymptotic controllers only and not over the class of all controllers. If a 
minimum over all controllers exist, then the reformulated problem is guaranteed to have an 
infimum. An example suggests that the reformulated problem does not achieve its infimum, 
that is, the infimum corresponds to a finite-time controller. By imposing an additional 
isoperimetric constraint on the reformulated problem one can, at least for this particular 
example, select an asymptotic controller that approximates this infimum. 
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